Regularized Discriminant Analysis and Its Application in Microarray
نویسندگان
چکیده
In this paper, we introduce a family of some modified versions of linear discriminant analysis, called “shrunken centroids regularized discriminant analysis” (SCRDA). These methods generalize the idea of the nearest shrunken centroids of Prediction Analysis of Microarray (PAM) into the classical discriminant analysis. These SCRDA methods are specially designed for classification problems in high dimension low sample size situations, for example microarray data. Through both simulation study and real life data, it is shown that these SCRDA methods perform uniformly well in the multivariate classification problems, especially outperform the currently popular PAM. Some of them are also suitable for feature elimination purpose and can be used as gene selection methods. The open source R codes for these methods are also available and will be added to the R libraries in the near future.
منابع مشابه
Regularized Discriminant Analysis and Its Application in Microarrays
In this paper, we introduce a modified version of linear discriminant analysis, called “shrunken centroids regularized discriminant analysis” (SCRDA). This method generalizes the idea of “nearest shrunken centroids” (NSC) [Tibshirani et al., 2003] into the classical discriminant analysis. The SCRDA method is specially designed for classification problems in high dimension low sample size situat...
متن کاملRegularized linear discriminant analysis and its application in microarrays.
In this paper, we introduce a modified version of linear discriminant analysis, called the "shrunken centroids regularized discriminant analysis" (SCRDA). This method generalizes the idea of the "nearest shrunken centroids" (NSC) (Tibshirani and others, 2003) into the classical discriminant analysis. The SCRDA method is specially designed for classification problems in high dimension low sample...
متن کاملIncorporating prior knowledge of gene functional groups into regularized discriminant analysis of microarray data
MOTIVATION Discriminant analysis for high-dimensional and low-sample-sized data has become a hot research topic in bioinformatics, mainly motivated by its importance and challenge in applications to tumor classifications for high-dimensional microarray data. Two of the popular methods are the nearest shrunken centroids, also called predictive analysis of microarray (PAM), and shrunken centroids...
متن کاملGD-RDA: A New Regularized Discriminant Analysis for High-Dimensional Data
High-throughput techniques bring novel tools and also statistical challenges to genomic research. Identification of which type of diseases a new patient belongs to has been recognized as an important problem. For high-dimensional small sample size data, the classical discriminant methods suffer from the singularity problem and are, therefore, no longer applicable in practice. In this article, w...
متن کاملModified linear discriminant analysis approaches for classification of high-dimensional microarray data
Linear discriminant analysis (LDA) is one of the most popular methods of classification. For high-dimensional microarray data classification, due to the small number of samples and large number of features, classical LDA has sub-optimal performance corresponding to the singularity and instability of the within-group covariance matrix. Two modified LDA approaches (MLDA and NLDA) were applied for...
متن کامل